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5-H ' We consider a two-hop wireless network where a transmitter communicates with a receiver via M relays 

^ with an amplify-and-forward (AF) protocol. Recent works have shown that sophisticated linear processing such as 

beamforming and distributed space-time coding (DSTC) at relays enables to improve the AF performance. However, 
the relative utility of these strategies depend on the available channel state information at transmitter (CSIT), which 
in turn depends on system parameters such as the speed of the underlying fading channel and that of training and 
feedback procedures. Moreover, it is of practical interest to have a single transmit scheme that handles different CSIT 
scenarios. This motivates us to consider a unified approach based on DSTC that potentially provides diversity gain 
with statistical CSIT and exploits some additional side information if available. Under individual power constraints 
' at the relays, we optimize the amplifier power allocation such that pairwise error probability conditioned on the 

O , available CSIT is minimized. Under perfect CSIT we propose an on-off gradient algorithm that efficiently finds a set 

of relays to switch on. Under partial and statistical CSIT, we propose a simple waterfilling algorithm that yields a 
non-trivial solution between maximum power allocation and a generalized STC that equalizes the averaged amplified 
^ ' noise for all relays. Moreover, we derive closed-form solutions for M ~ 2 and in certain asymptotic regimes that 

, enable an easy interpretation of the proposed algorithms. It is found that an appropriate amplifier power allocation 

' is mandatory for DSTC to offer sufficient diversity and power gain in a general network topology. 

(N 

■ I. Motivation 

O ' 

00 We consider a wireless relay network illustrated in Fig. 1 where a transmitter communicates with a receiver via 
O M relays and each terminal has a single antenna. We let h, g denote the channel vector between the transmitter and 
J> the relays, the channel vector between the relays and the receiver. For its simplicity, we focus on an amplify-and- 
^ forward (AF) protocol [1], [2] in a two-hop communication: in the first T channel uses the transmitter broadcasts a 
codeword, then in the second T channel uses relays amplify and forward the observed codeword by applying some 
linear precoder (to be specified later). We assume that a transmitter and M relays have individual power constraints 
rather than a total power constraint, since in a practical wireless network terminals are physically distributed and 
hence are subject to their own power supplies. 

In a classical AF protocol, the amplifier coefficients have been determined so as to satisfy required power 
constraints [1]. In order to improve the performance of the AF protocol, a large number of recent works have 
considered some additional linear processing at relays [3], [4], [5], [6], [7], [8], [9], [10]. These works can be 
roughly classified into two classes according to their assumption of channel state information at transmitter (CSIT) 
and their objective. The first class assumes perfect CSIT and aims to maximize either the achievable rate or the 
instantaneous receive SNR [3], [4], [5], [6]. The resulting transmit scheme yields beamforming with appropriate 
power allocation. The second class assumes only statistical CSIT and aims to minimize the error probability by 
designing some type of linear precoder [7], [8], [9], [10]. In particular, significant attention has been paid to 
distributed space-time coding (DSTC) in which each relay sends a different column of a STC matrix [11], [8], [9], 
[10]. With a single antenna at each terminal, we hasten to say that the goal of DSTC is to approach full diversity 
gain of M offered by the relay-receiver channels g since the multiplexing gain of the MISO channel under the 
half-duplex constraint is very limited, i.e. 1/2. Notice that an AF based DSTC that achieves the optimal diversity- 
multiplexing tradeoff [12] has been well studied [13], [14], [15], [16]. It clearly appears that a practical utility of 
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these two approaches depends on the available CSIT, which in turn depends on the speed of fading and that of 

feedback and training procedures. Notice that in a two-hop communication model obtaining perfect CSIT is rather 
challenging because the transmitter needs at least a two-step training and feedback process, i.e. first learns h and 
then g, which typically induces additional delay and estimation error. Consequently, the perfect CSIT assumption 
holds only if the underlying fading is quasi-static and a sufficiently fast feedback and training is available. For 
this particular case, the first approach might be useful. On the contrary, if a rate of feedback and training is much 
slower than the coherence time of the channel, the second approach based on the statistical channel knowledge is 
more appropriate. 

The above observation motivates us to find a unified approach that can handle different CSIT scenarios, rather 
than changing a transmit strategy as a function of the quality of side information. To this end, we fix our transmission 
strategy to DSTC that potentially provides diversity gain with statistical CSIT and further power gain if additional 
side information is available. Our goal is not to find the optimal strategy for each CSIT case but to propose a 
unified DSTC scheme that simply adapts the amplifier power allocation to available CSIT. Among a large family 
of DSTC, we consider linear dispersion (LD) codes [17] because they offer desirable performance in terms of 
diversity gain and coding gain [10], [18] and moreover keep the amplified noise white. The latter considerably 
simplifies the power allocation strategy. We assume perfect synchronization between relay terminals and perfect 
channel state information available at the receiver, which is necessary for coherent detection. Under this setting, we 
will address the following question: how does the quality of CSIT impact the amplifier power allocation and the 
resulting performance of the DSTC? To answer the question, we optimize the amplifier power allocation in such 
a manner that the pairwise error probability (PEP) conditioned on CSIT is minimized. Note that the conditional 
PEP is a performance criteria widely used in the Uterature of STC [19], [20], [21] and DSTC [22], [10], [18]. In 
particular, [21] has provided elegant precoder designs that minimize the PEP conditioned on CSIT for orthogonal 
STC. Unfortunately the extension of this work to a two-hop relay network appears very difficult due to the non- 
convexity of the underlying problem. We examine the following CSIT cases : 1) perfect knowledge of the absolute 
value of the entries of h and g (perfect CSIT), 2) perfect knowledge of absolute value of the entries of h and 
statistical knowledge of g (partial CSIT), and 3) statistical knowledge of h and g (statistical CSIT). 

Under perfect CSIT, the PEP minimization reduces to the maximization of an approximate receive SNR. The 
optimal power allocation strategy turns out to be an on-off strategy, whereby some relays are switched off and 
others transmit at maximum available power. We propose an on-off gradient algorithm that efficiently finds the 
optimal set of relays to switch on. Under partial and statistical CSIT, the conditional PEP minimization appears 
very difficult due to the self-interference caused by amplified noise and calls for a good heuristic approximation. 
First, we apply a Laplace-based saddle point approximation of an inherent integral in order to make the problem 
amenable. Since the approximated problem is still non-convex due to averaged amplified noises, we transform it 
into a convex problem via a log transformation [23] assuming a high transmitter power (which is the regime of 
interest). For a new objective function, we propose a very simple waterfilling algorithm that yields a non-trivial 
solution between maximum power allocation and a generalized STC that equalizes the averaged amplified noise, 
i.e. Pi'jg^i with 7g j being the variance of the channel between relay i and the receiver (in a classical STC we 
consider jg^i = 1, Vi). We derive closed-form solutions for M = 2 and in certain asymptotic regimes that enable an 
easy interpretation of the proposed algorithms. It is found that that an appropriate power allocation is mandatory 
for DSTC in order to provide diversity and power gains in a general network topology. 

In order to situate this work in the context of relevant literature, we note that the LD based DSTC for a two-hop 
AF network has been addressed for a single-antenna case [10] and for a multiple antenna case [18]. In both works, 
Jing and Hassibi provided the diversity analysis by optimizing the power partition between the transmitter and the 
relays under the assumptions that the transmitter and M relays are subject to a total power constraint and that both 
channels have unit variance. Clearly, the optimal power partition under this setting, letting the transmitter use half 
of the total power and each relay share the other half, does not hold for a general network topology with unequal 
variances. We make a progress with respect to this since our proposed waterfilling solution can be applied for 
any set of variances under statistical CSIT and moreover handles the partial CSIT case. With perfect CSIT, Jing 
proposed a cooperative beamforming scheme for the same two-hop AF network [5]. Although this beamforming 
scheme provides a non-negligible power gain compared to our on-off power allocation as shown in Section VI, 
we remark that our on-off algorithm is much simpler and can be implemented at the receiver without requiring 
any knowledge at the transmitter. To this end, it suffices that the receiver sends to each relay a feedback of one 
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bit indicating whether to activate or not. Hence, our on-off algorithm might be appealing due to its robustness and 
simplicity despite its suboptimal performance. 

The rest of the paper is organized as follows. After briefly introducing the two-hop network model in Section 
n, we derive the the conditional PEP upper bounds for different CSIT cases in Section m. In Section IV we 
propose efficient algorithms that solve the conditional PEP minimization, namely on-off gradient algorithm for 
perfect CSIT and waterfiUing algorithm for partial and statistical CSIT. We provide some asymptotic properties of 
these algorithms in Section V and numerical examples in Section VI. Finally we conclude the paper in Section 

vn. 



II. System Model 

We consider frequency-flat fading channels and let g = [gi, - ■ ■ ,gM]'^, h = [hi,. . . , HmY' denote the chan- 
nel vector between transmitter and relays, the channel vector between relays and receiver, respectively. We as- 
sume the entries of h and g are i.i.d. zero-mean circularly symmetric complex Gaussian with variance 7^ = 
[7/1! , . . . , , 7p = Ylgi , . . . , 7gM] respectively. The variance of each channel is assumed to capture path-loss and 
shadowing. We assume a block fading model, namely h and g remain constant over a block of 2T channel uses. 
In this paper we do not consider a transmitter-receiver direct link for simplicity. It is well known however that the 
direct link should be taken into account if one aims at optimizing the diversity-multiplexing tradeoff [13], [14], 
[16], [15]. The communication between the transmitter and the receiver is performed in two steps. The transmitter 
first broadcasts a symbol vector s = [si, . . . , st]^ £ C^^^ with E[ss'^] = It and relay i receives 



where Ps is the power of the transmitter and iij ~ N{j^(0, A^qIt) is AWGN. In the second T channel uses, M 
relays amplify and forward the observed codeword by applying a hnear precoder. Namely, the transmit vector Xj 
of relay i is given by 

Xj = QiAiYi (1) 

where qi denote a complex ampUfier coefficient of relay i, Aj G C'^^'^ is a unitary matrix satisfying Af^Aj = It, 
and Xj should satisfy a power constraint, i.e. 

E[||x,||2] = |gi|2E[||yi|p]<rp, 

where the expectation is with respect to nj for a short-term constraint and with respect to both rij and hj for a 
long-term constraint. This in turns imposes a constraint on {qi} such that < Pj where Pj denotes the maximum 
amplifier power of relay i given by 

short-term 



P = I Ps\hi\^+No' 

I Fl^f^iVo' long-term 
The received signal at the final destination is given by 

M M M 

r = ^ giVii -I- w = ^ gihiqiAiS + ^ giqiAiUi + w 

i=l 1=1 i=l 

where w ~ J^£^{0, NqIt) is AWGN at the receiver uncorrelated with {rij}. The received vector can be further 
simplified to 

r = VpISQf + V (2) 

where we let S = [Ais, . . . , Ams] G c^^^ denote a LD codeword, Q = diag(g'i, . . . , qm) is a diagonal matrix 
with M amphfier coefficients, f = [higi, ... , hMguY a composite channel vector, and we let 

M 

V = ^ qigiAiXii + w (3) 
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denote the overall noise whose covariance is given by 

E[vv^] = No (j2 h\^\9i\^ + 1^ It = a^^lr 

It follows that the overall noise seen by the receiver is white and this considerably simplifies the amplifier power 
allocation in the following sections. 

III. Conditional PEP 

With perfect knowledge of both h and g, the receiver can perform Maximum Likelihood decoding ^ by estimating 
a codeword according to 

S = argmin llr - v^SQflP (4) 

se§ 

When the transmitter has only partial knowledge of the channels, it is reasonable to consider the pairwise error 
probability (PEP) conditioned on the available CSIT. In the following we derive the expressions of the conditional 
PEP for three different CSIT cases: 1) perfect CSIT, where the transmitter knows the absolute values of the entries 
of h, g, 2) partial CSIT where the transmitter knows the absolute values of the entries of h and 7^, 3) statistical 
CSIT where the transmitter knows and 7^. Perfect CSIT corresponds to the case of quasi-static fading, while 
statistical CSIT corresponds to the case of fast fading so that the transmitter can track only the second order statistics 
of the chaimel. Finally, partial CSIT is an intermediate case relevant to a time-division duplexing system where the 
transmitter learns perfectly h by reciprocity but only statistically g due to a low-rate feedback. 

A. Perfect CSIT 

The PEP conditioned on h, g for any k ^ I is defined by 

P{Sk^Si\h,g) ^ Pr(||r-y^S,QHg||2 < ||r - VpISfcQHgl pjSfc, h, g) = Pr (^^(S;,, S;) < 

where the composite channel f is decoupled into Hg with H = diag(h), where we define squared Euclidean 
distance between Si and Sfe as 

d2(Sfc, S;) = p,g^H^Q^(Sfc - SifiSk - S,)QHg 

and where k = 2y^Re{v^(Sfe — S;)Qf} is a real Gaussian random variable distributed as 3M^]^(0, 2al(f). In order 
to obtain a upper bound of the PEP, we assume that the term (S^ — S;)^(Sfc — S;) has a full rank M, i.e. the LD 
code achieves a full diversity (for a special case of orthogonal STC this always holds). By letting Xmin denote the 
smallest singular value of (S^ — S;)^(Sfc — S/) over all possible codewords, we obtain the inequality 



rf'(Sik,SO >p.A^i„g^H^Q^QHg 



which yields a Chemoff bound 



P,S.^S,lh,g)<exp(-?.^=i^:|p^). (5) 

Minimizing the RHS of (5) corresponds to maximizing approximated receive SNR, given by 

P.A^ing^H^Q^QHg _ ZZiPihW ... 

where we let r] = XminPs/ and let pi = denote the amplifier power of relay i. Notice that the above 
function depends on the absolute values of channels and of amplifier coefficients. 

*In practice, efficient decoding techniques sucli as sphere decoding can be implemented to achieve near ML results [17]. 
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B. Partial CSIT 

The PEP upper bound conditioned on h, 7^ is obtained by averaging (5) over the distribution of g 



P(Sfe^Si|h,7j 



(a) 



/ 1 

J det(7rdia 



= det 



diag(7g)) 

V 



exp < -g 



H 



jyH^PH 



+ diag(7j M g > dg 



M 



diag(7,)H^PH + lM =n 1 + 

1 + Ei=i IgiPi J i=i\ 1 + Ei=i 



(7) 



where in (a) we let P = diag(pi, . . . ,pm) and apply a Laplace-based saddle point approximation that becomes 
accurate as the number of relays increases without bound (see further Appendix I). This saddle point approximation 
is inspired by the approximation method suggested in [24] to evaluate the expectation of quotients of quadratic 
forms in Gaussian random variables. We remark that, in order to maximize the corresponding cost function in (7), 
only the absolute values of the entries of h are needed. 

C. Statistical CSIT 

The PEP upper bound conditioned on "ih^^g is obtained by averaging (5) over the distribution of h and g 



P(Sfc^Si|7^,7j <E, 



gj - --g 

(a) 



det-^ ( Im + 

M 



ri 



(c) 



g 

M 



tr(G^PG) 

Vlhj\9]?Pj 



diag(7jG^PG 



M 



j=i P' 



-7 + ln(p,) + ( 



-Q 1 + Ei=i IgiPi 



VlgjlhjPj 



Vlgj^hjPj 
1 + E»=i IgiPi 



(8) 



where in (a) we apply the Laplace-based saddle-point approximation of the integral mentioned above, (b) follows 
by noticing that \gi\'^/jgi is an exponential random variable with unit mean and using [25, 3.352] where we write 
Pj = i^'^m'''^',^, and let Ei{x) = denote the exponential integral. In (c) we assume that pj is large 

(77 00) so that 

- (-i)^+Hp.)-' 



kk\ 



= -7 + ln(p,) + O 



1 



Pj 



where 7 is the Euler constant, and finally in (d) we assume In(pj) » 7. 

IV. Power allocation algorithms 
This section proposes efficient power allocation algorithms to optimize (6), (7), and (8). 



A. Perfect CSIT 

Under perfect CSIT, the optimal is obtained by maximizing 



/o(p: 



A Ei=i CtiPi 

1 + Y^iii l^iPi 



— I , , and we let a,- 



(9) 



where pi is subject to the maximum amplifier power of relay i, i.e. pi < Pi 
and Pi = We remark that the linear constraints form a feasible region V composed by M half-spaces with 
2^ — 1 vertices. For M = 2 the feasible region V is a rectangular region with 3 vertices (Pi, 0), (0, P2), (Pi, P2) 
plus the origin. Since this problem is quasi-linear, it is possible to transform it into a linear program. By exploiting 
the structure of the problem, we propose a more efficient algorithm to find the solution. First, we start with the 
following proposition. 
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Proposition 1 The solution to (9) is always found in the one of 2^ — 1 vertices of the feasible region V. 
Moreover, at the solution p*, the entries of the gradient satisfy the following inequahty for z = 1, . . . , M. 



>0, 
<0, 



if p* 
Hp* 







(10) 



1. 



Proof see Appendix II. 

For M = 2 we have a closed form solution of the optimal power allocation as an obvious result of Proposition 
Corollary 1 For M = 2, we find a closed-form solution given by 



ijPl,P2) = < 



(0,P2), 
iPl,P2), 



if 1^1 
if 1^2 
if 1^1 



< 
< 



> 



pAg2\'- 



Ps\hl\^+Pr 
Pr|g2|^ 



(11) 



2\'> 



pAsinhil 



Ps\hl\^+Pr\gi\^+No 



Proof see Appendix m. 

In order to visualize the conditions of activating relay 1 and/or relay 2, we provide a graphical representation of 
the on-off region in Fig. 5. Interestingly, it can be observed there is a minimum value of \hi\'^ in order for relay i 
to be activated. Namely relay 1 is activated independently of relay 2 if 

\hi\ > —ml 

Ps 

which is readily obtained when letting |/i2p oo in the first inequahty of the third condition, i.e. |/tip > 
— P'-Jg^l yj(,g versa. 

For M > 2, the solution does not lead itself to a simple closed form expression. Nevertheless, as a straightforward 
result of Proposition 1 we propose the following algorithm to solve (9). 
On-off gradient algorithm 

1) Initialize p^^^ to an arbitrary vertex G V 



2) 



At iteration n, compute the gradients ^(p*-"^) 
and update 



P 



(n+l) 



pS"^ + v.(p 



-P if ^jiP^""^] 



< and Vi(pW) 



Pi 



in)] 



(12) 



Pi if ^(p^'*)) > 0, and 



x]^ denotes the 



where we let Vi(p^"^) - ^ 
value of X truncated to the interval [a, b]. 
3) Stop if p(") satisfies (10). 

Proposition 2 The on-off gradient algorithm converges to the global maximum. 
Proof See Appendix IV. 

Fig. 2 plots the convergence behavior of the proposed on-off algorithm when we let Ps= Pr = 10 and consider 
equal variances 'yh,i = 7g,i = 1 for all i. The objective values are normalized with respect to the optimal values 
and averaged over a large number of random channel realizations. Fig. 2 as well as other examples show that the 
proposed on-off gradient algorithm converges only after a few iterations irrespectively of M and of the initialization. 

It is worth noticing that this on-off algorithm can be implemented at the receiver without any knowledge at the 
transmitter. To this end, it suffices that the receiver sends to each relay a feedback of one bit indicating whether to 
activate or not. 



B. Partial CSIT 

When the transmitter knows h and 7^, the problem reduces to minimizing (7) or maximizing 

/,(p) = X; In f 1 + rjM'—^ ) (13) 

i=i V ^ + 22j=ii9jPjJ 

The term ^ ^^''^'''P' , similar to pi = , /Dl'^^^'^' defined in (b) of (7) for tiie statistical CSIT case, can be 
interpreted as the contribution of relay i to the receive SNR. With some abuse of notation, we let pi denote 



7 



under partial CSIT. Unfortunately, the function /i is neither concave or convex in p. Nevertheless, 
assuming rj ^ oo (which is the regime of our interest), let us consider a new objective function, given by 



M / 

O-iPi 



(14) 



where we let aj = ri^gi\hi\'^ for notation simplicity. It is well known that the function J(p) can be transformed 
into a concave function through a log transformation [23]. In the following, we use the notation x to express Inx 
for any variable x (equivalently x = e^). The objective function can be expressed in terms of p as 



M 



J(p) = ^ ln(a,eP-) - M In 1 + ^ ^MPjhgj | (15) 
i=i \ j 



Proposition 3 The optimal p that maximizes (15) is given 

Pi = \l^ - 7fli]-oo ' "P^^ ~ 

L /9« J 

where /i* is the water level that is determined as follows. Let tt denote a permutation such that 



(16) 



^7r(l)7g,7r(l) < • ■ ■ < -P7r(M)7g,7r(M) • (17) 

and define /ij = In/Xj, where /Ltj is given by 

l + ELi^M»)7gM0 ^■ = i^...,M. (18) 

j 

The optimal water level /i* is obtained as that the value out of these M possible ones that maximizes the objective 
function, namely 

/2* = arg max J{p.j) (19) 

where J{Ji) is the objective function (15) parameterized by the water level defined in (31). 
Proof see Appendix V. 

Fig. 3 illustrates an example of our waterfiUing solution for the case M = 3. The power curve of relay i increases 
linearly with slope l/7gi and then is bounded at its maximum amplifier power Pi. In this example, relays 1 and 2 
with Pi7gi < /U* are allocated their maximum amplifier powers while relay 3 is allocated Depending on the 
water level, this waterfilUng yields a non-trivial solution between maximum power allocation (pj = Pi,Vz) and a 
generalized STC that equaUzes the averaged ampUfied noise Pi7gi = /Lt* (notice a classical STC considers 7gj = 1 
for all i). Note that the proposed waterfilling approach only requires a search over M values in order to determine 
the water level, and consequently it is extremely simple. 

Corollary 2 For M = 2, we find a closed-form solution given by 



(Pl,P2) 




if P2lg2 > Pllgl + 1 

if P2lg2 < Pllgl " 1 ^^^^ 

Otherwise 



Proof see Appendix VI. 

By expressing the power constraint Pj = ^-p^^qr/Vo ' above allocation policy can be graphically represented 
as a function of | /tip and |/i2p in Fig. 6. Similarly to Fig. 5 for the case of perfect CSIT, there exists a minimum 
value of so that relay i is allocated its maximum amplifier power. Namely, relay i is allocated its maximum 
amphfier power independently of relay j / i if > 2i1mazI^±_ Interestingly, the threshold associated to relay 
i depends only on the z-th chaimel, as opposed of what happened in the perfect CSIT case. This means that the 
power allocation is more selfish under partial/statistical CSIT in order to increase the reliability of the wireless link. 
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C. Statistical CSIT 

When the transmitter only knows the variances 7^, 'jf^ of the chaimels, we minimize the expression (8) which is 
equivalent to maximizing 

/.(p)=E'"( , ;?ir- ) 

i=l V + 22j=iPn9jJ 

where we ignored doubly logarithmic terms and the amplifier power pi of relay i is subject to a long-term individual 
power constraint Pi = ^ ^^\t^ for all i. Again, by performing a log-transformation we obtain precisely the same 
objective function J(p) in (15) where Oj = r]^gi\hi\'^ defined in the previous partial CSIT case is replaced with 
iHgilhi- Hence, the waterfilling solution proposed for the partial CSIT case can be directly applied to the statistical 
CSIT case and needs to be implemented once for a given set of variances Jhilg- The power allocation region for 
M = 2 is given in Fig. 6 where the axes are replaced by jhi and jh2- 

V. Asymptotic behavior 

This section studies the asymptotic behavior of the proposed power allocation algorithms and gives an informal 
discussion on the resulting error rate performance. 

A. Relays close to transmitter — > 00 

We consider the regime where 7h,j ^ 00 or equivalently — ^ 00 for all i at the same rate while treating 
other parameters finite. First we examine a two relay case. Under prefect CSIT, Fig. 5 implies that both relays tend 
to be switched on in this regime. Under partial and statistical CSIT cases, it follows immediately from Fig. 6 that 
two relays shall transmit at their maximum powers. 

For M > 2 the same conclusion can be drawn as a straightforward result of Proposition 1 for the perfect CSIT 
case. The condition for which all M relays are switched on under perfect CSIT is given by 

where the RHS corresponds to the objective value /o(-Pi, • • • , Pm) when letting all relays transmit with maximum 
power. The RHS is upper bounded by 

where (a) follows from P, |/i, p = — f,''^!^^Lf < — and (b) follows from , , i — its" ^ 1- In the Umit of 
— > oo,Vj, both (a) and (b) hold with equality and we have 

M 

\hi\''>-Y.\9j\^^ i = h---,M 
P^ j=i 

so that all relays are allocated their maximum powers. From the upper bound (22) of the objective function, it can 
be expected that the performance of distributed LD code improves for a larger M. Under partial and statistical 
CSIT, the proposed waterfilling tends to allocate the maximum power to each relay. This can be seen immediately 
from the waterfilhng solution depicted in Fig.3. As Pj 0, the values {Pi7p,j} above which the power curves are 
bounded become much smaller than the lowest water level //min = ^/M. This means that all relays are allocated 
the maximum powers. The following remarks are in order; 

1) Since the waterfilling algorithm under partial CSIT and on-off gradient algorithm coincide, both algorithms 
yield the same error performance. This impUes that the knowledge of g has a negligible effect on the 
performance in the regime. 

2) As a final remark, the same behavior can be observed in the following cases. 

• the transmitter power increases ps ^ oo. 

• the variance 7^ of the relay-receiver chaimel decreases, i.e. 7^ — >^ or equivalently l^jp for all i 
at the same rate. 
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B. Relays get close to receiver 7^—^00 

We consider the regime where 70 j 00 or equivalently IgA'^ 00 for all i at the same rate. First we examine 
a two-relay case M = 2. Under perfect CSIT, it can be observed that the threshold values ElML above which 
each relay becomes activated (represented by straight lines in Fig. 5) get large and the on-off algorithm converges 
to relay selection. Similarly, the waterfilling algorithm also tends to allocate only one relay with maximum power 
under partial and statistical CSIT cases as expected from Fig. 6. The only exception is a symmetric variance case 

For M > 2 under perfect CSIT, we next show that the on-off strategy converges to single relay selection as 
l^ip — > 00, Vi To see this, first observe that if the one-off algorithm chooses only one relay to switch on, it selects 
the relay 

I = arg max — nr— 

The objective value is given by (with some abuse of notation in the argument of the cost function) 

where the inequality holds with equality for \gi* ^ 00. If the on-off algorithm activates any arbitrary set of m > 1 
relays, the corresponding objective is upper bounded by 

Y^m l/).|2|«.|2p. Y^m I /, . 1 2 1 1 2 p. , . ™ 
/o(n, ■■■,Pm, OM-mj - 1 ,Y^m i |2p < v^m i |2p " '-^^^ 

^ 2^j=i\9j\ ^3 2^j=i\9j\ ^3 

where in (a) we define 9i = ^„ ^ Lp with < < 1 and V,- Oi = 1. We see that the last expression (24) is 
strictly smaller than (23) for any m and regardless of a set of relays. This implies that as the relays get close to 
the receiver, the on-off algorithm converges to single relay selection. 

The waterfilling algorithm under partial and statistical CSIT lets only one relay transmit with the maximum 
power as we see in the following. Let us first consider the permutation vr given in (17) sorting relays according to 
-P7r(i)7g,7r(i) < P-K{2)lg,-K{2) < ■ ■ < P-K{M)lg,-K{M) with Strict inequalities. As 7^,, 00 for all i, the possible water 
level in (18) is roughly given by jij Ri ^^=1 -P^^wTg.Trw levels tend to be sorted as 

Mmin ^ /il < ^^2 < • • • < Mm = Mmax (25) 

Notice that the inequality jjij < /Xj+i holds if Ylil=i{P'K{3+i)lg,Tr{3+i) ~ Pn{i)lg,TT(i)) > 1- We show that the function 
J is monotonically decreasing for Hi < n < hm and the optimal water level is always given by /Lti- We recall that 
the derivative of J with respect to /i in (32) can be expressed as a function of fi as 

VJ=1 ^ 

where we assumed |C(/i)| > 0. Under the specific order of the water levels given in (25), we can further express 
the derivative V Jj for each interval fij < fi < fij+i for j = 1, . . . , M — 1 such that 

Jif'j - m) 



Since we have VJj < for any interval j, it clearly appears that the function is monotonically decreasing thus 
the waterfilling algorithm allocates maximum power only to the relay 7r(l) by letting = //i. In order to have 
an insight on the error rate perform 
approximated receive SNR value /q. 



an insight on the error rate performance achieved by the waterfilling letting Pi = ^ for all i, we evaluate the 
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where (a) holds with equahty as /xi — >^ oo (Pj7gi oo for all i), and in (c) we define 9i = , If, — with 
< 0j < 1 and 9i = 1. We see that the final expression is dominated by (23) achieved by single relay selection 
under perfect CSIT. The following remarks are in order; 

1) The optimal transmit scheme in this regime is single relay selection that chooses roughly the relay with the 
largest Activating more than one relay becomes highly suboptimal due to large ampUfied noise. 

2) The power allocation under partial and statistical CSIT is the same and equahzes Pi'ygi. 

3) As a final remark, the same behavior can be observed in the following cases. 

• the relay power increases pr —>■ oo. 

• the variance of the transmitter-relay channel decreases 7^ 0. 

which yield Pi\gi\'^ ^ 00 under perfect CSIT and Pi^g^i 00 under partial and statistical CSIT. 

VI. Numerical results 

In this section, we provide some numerical results to illustrate the behavior of the proposed power allocation 
algorithms. Assuming a homogeneous network, we let ps = Pr- We consider BPSK modulation and generate 
randomly a LD code with M = T drawn from an isotropic distribution. 

First, we compare the proposed on-off algorithm with other schemes in a system with M = 2 relays and equal 
variances jhi = lgi = ^ for i = 1,2. Fig. 7 shows the block error probability versus per-relay SNR Pt/Nq with the 
on-off algorithm, network beamforming of [5], and maximum power allocation that lets both relays transmit with 
their peak powers. For a reference we also plot the performance of our waterfiUing algorithm under statistical CSIT. 
We observe that network beamforming outperforms the on-off gradient algorithm by roughly 3 dB by exploiting 
full channel knowledge and that both schemes achieve the same diversity gain. On the contrary, maximum power 
allocation has a substantial performance loss and fails to achieve full diversity gain. This clearly shows that an 
appropriate power allocation is essential for distributed LD code to provide diversity gain. 

Next, we examine how the network topology impacts the proposed power allocation algorithms and the resulting 
BER performance. To model a simple network topology, we consider a unit transmitter-receiver distance and let the 
transmitter-relay distance varies in the range < r < 1. The resulting variances are 7^, = and ^gi = 1/(1 — r)^ 
for all i. For the sake of fair comparison between systems with different M, we assume that the whole network 
power y is equally shared between the transmitter and M relays so that p,. = T/(M + 1). Fig. 8 shows the BER 
performance of the proposed power allocation algorithms with M = 2, 4, 6 and V/Nq = 15 dB along with the 
performance of the direct transmission with a fixed power CP. Fig. 9 shows the averaged allocated power ratio, 
i.e. X^^i lE[pt(i)/^i(0]' o'" equivalently the effective number of relays. The following remarks are in order: 1) 
As the relays get closer to the transmitter r — > 0, the transmitter activates all relays with their maximum power. 
The waterfiUing solution under partial CSIT converges to the on-off gradient algorithm in the limit of r — > 0, 
which implies that the knowledge of g has a negUgible impact on the performance. The result agrees well with 
the analysis provided in subsection V-A. 2) As the relays get closer to the receiver r — > 1, the optimal strategy 
activates only one relay to limit the amphfied noise. As seen in Fig. 9 the on-off gradient algorithm indeed reduces 
to relay selection. On the contrary, the waterfilUng solution equalizes pi7(,i = • • • = pulgM both under partial 
and statistical CSIT, and moreover it converges to the same error performance independently the number of relays. 
Under the given setting where Pi'^gi = • • • = PmIqM, the waterfiUing solution under statistical CSIT lets all relays 
transmit with maximum power. The result is in a good agreement with the analysis of subsection V-B. 

Fig. 10 shows the BER performance versus 7/Nq for M = 2, 4, 8. Here, we randomly choose the relay-receiver 
distances and let 7y=[0.85, 3.17, 1.50, 1.89, 2.06, 2.36, 3.19, 3.99]. The transmitter-relay distance r = 0.5 is fixed 
ilhi = 4 for any i). Compared to the direct transmission, DSTC with our proposed power allocation algorithms 
yields significant diversity gain at moderate to high power regime. Moreover, additional CSIT yields a considerable 
power gain. 

VII. Conclusions 

We considered a two-hop wireless network where M relays aid one transmitter-receiver pair to communicate via 
DSTC together with the AF protocol. In order to study the impact of CSIT on the design and the performance of 
DSTC, we optimized the amplifier power allocation under individual power constraints so that the PEP conditioned 
to the available CSIT is minimized. Under perfect CSIT we proposed the on-off gradient algorithm that efficiently 
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finds a subset of relays to switch on. It turns out that this algorithm can be implemented at the receiver if the receiver 

can send a one-bit feedback to each relay indicating whether to switch on or not. Under partial and statistical CSIT 
we derived a simple waterfiUing algorithm that yields a non-trivial solution between maximum power allocation and 
a generalized STC that equalizes the averaged amplified powers for all relays. Closed-form solutions were derived 
for M = 2 and in certain asymptotic regimes. Namely, when the relays are physically close to the transmitter, the 
on-off algorithm and the waterfiUing algorithm coincide and both let all relays transmit with maximum amplifier 
powers. When the relay are close to the receiver, the on-off algorithm converges to relay selection in order to 
minimize the amplified noise seen by the receiver while the waterfiUing equalizes the averaged ampUfied noise and 
becomes highly suboptimal. The proposed amplifier power allocation algorithms were derived for a particular type 
of linear dispersion STC but can be extended to more general LD code as well as other classes of STC as long as 
the amplified noise remains white. 

Appendix I 
Saddle point approximation 

The objective of this appendix is to justify the approximations (a) in (7) and (8) and to show that the approximation 
is vaUd as the number of relays increases without bound. Let us denote 



g^H^PHg 6=l + g^Pg 



We need to evaluate 



E 



exp ( —7] 



a 



E 



exp —T] 



g^H^PHg 



1 + g//pg 



where the expectation is with respect to the statistics of g. First of all, observe that we can write 

g^H^PHg 



exp I —rj 



1 + gHpg 



lim Xji 

n^oo 



where 



On the other hand. 



ik>0 



l^nl < 



n u , 

^ k\ \b) 

k>0 



< exp 



g^H^PHg 



rj- 



1 + g^Pg 



< exp 



g^H^PHg 



7]- 



gHpg 



< exp r? max Ihjl < oo 



Hence, the bounded convergence theorem ensures that we can write 



E 



exp ( -rj- 



E 



lim X„ 



lim E [Xn] = y ^4^E 



fe>0 



ay 



(27) 



and we can therefore concentrate on the study of the moments 

r. = E[(- 

In particular, we can follow the procedure introduced in [24], which is based on a Laplace approximation of the 
integral. More specifically, in [24] it was shown that a Laplace approximation of about the origin leads to the 
identity 



rfc = E 



E[6] 



(28) 



12 



where both expectations are with respect to g and we have R/. ^ as M ^ oo. Although the procedure is 
somewhat tedious, one can extend this to show that supj^. Rk — >^ 0. As a direct consequence of this theorem, we 
have that as M ^ oo , it holds that^ 



E 



exp I —T] 



a 



^ k\ 

k>0 




Rk 



E 



exp —T] 



'E[b] 



fe>0 



where now 



E 

fe>0 



k\ 



Rk 



< sup Rk exp {—r]) — >■ 
k 



and this justifies the approximation used in the paper. 

Appendix II 
Proof of Proposition 1 

In order to prove Proposition 1, we consider the i-th entry of the gradient of the objective function, given by 

dfo ^ + «i EjVi PjPj - Pi (^jPj 

When we treat the variables fixed, the i-th gradient can be expressed as a function of pi in the form 

(l3 pf+C )^ , where Q > and are constants. Depending on the sign of ^j, the gradient is always negative or 
positive, i.e. the function is monotonically decreasing or increasing in each pi ^. Since the objective function cannot 
be maximized at < pi < Pi, the solution of (9) is achieved only at one of the vertices. The second part follows 
directly from the monotonicity of the function in each component pi. Namely, the solution is achieved by the vertex 
at which the objective function cannot further increase beyond the thresholds. 

Appendix 111 

Proof of Corollary 1 : a closed-form solution of M = 2 under perfect CSIT 
From Proposition 1, we see immediately that the power allocation of two relays depend on the sign of 

6(P2) = ai + Ap2, 6(Pi) = "2 - Api 

where we let A = ai/32 — /3ia2 = |5'i5'2p(|/iiP — l^'2p)- Moreover, it is sufficient to check the sign of and ^2 at 
each vertex to determine the optimum power allocation. Table I summarizes the optimal solution and the conditions 
; the optimal solution is given by (Pi, 0) if and only if ^2{Pi) = 02 — APi < holds while it is given by a vertex 



(0, P2) if and only if we have ^i(-P2) 
inequahties yield (11). 



ai + AP2 < 0. Finally, both relays are activated if ^ < A < ^. These 



Appendix IV 

Proof of Proposition 2 : convergence of on-off gradient algorithm 

We have to first prove that the objective is non-decreasing, i.e. /(p^"^^'') > /(p^"'') for any iteration n. Identifying 
that the update in (12) is nothing than a discrete steepest ascent algorithm with a fixed step size, the objective 
always increases. It remains to prove that the converged point is the global maximum. In other words, the stopping 
criterion above is sufficient to guarantee a global convergence. It is not difficult to see that there is a unique p* 
satisfying the condition (10) such that the signs of the gradients and the powers match. Otherwise, we can always 
increase the objective by switching on (off) the power with a positive (negative) gradient. 

^In the last equality of the following section, one should justify the fact that expectation and sum can be interchanged. This is not difficult 
to see, but the proof is omitted due to space constraints. 

'For = 0, the function is constant in pi, then we let pi = 0. 
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Appendix V 

Proof of Proposition 3 : waterfilling solution under partial CSIT 

Since J(p) is a strictly concave function of p, we solve the KKT conditions which are necessary and sufficient 
for optimality. By letting Aj > a Lagrangian variable associated to the individual power constraint pi < Pi, we 
obtain the KKT conditions given by 

^_ M,,^e.m) ^^^^ ^^^^ 

1 + Ej7fljexp(pj) 

Summing the above equation over all i and defining / = jgj exp{pj) and /x = we obtain 

I = Mfi-l (30) 

It follows that /i is lower bounded by ^u™"^ = jj and upper bounded by /^^^^ = m" ' ■ Plugging (30) into 

(29) and using the inequality Aj > 0, we readily obtain the optimal power given in (16). 

It remains to determine the optimal water level jl* that maximizes the objective function J (note that the individual 
power constraint is always satisfied for any To this end, we define C(/i) and S(/i) as 

e(/i) = {i\pi = ii- e(/x) = {i\pi = Pi} 

Plugging (16) into (15), the function can be expressed in terms of 

J(/i) = |e(A)|A+ Yl ^i-Mln (l + |e(/x)|exp(/i)+ ^ jgieMPi)] +J]lnai (31) 



i6e(/i) \ iee(/i) 



where |C|, |S| denotes the cardinaUty of the set C, C respectively. Since the function J(/x) is strictly concave in p,, 
the optimal jl must satisfy |^ = where 

^ = m)\ ( 1 ^^^^ ^"l (32) 

dn ^ 1 + |e(A)| exp(/i) + E,ee(A) ^MPi) J 

which yields 

exp(u) = (33) 

for |S(/i)| > 0. Notice that for |C(/i)| = 0, it can be shown that the objective function is a monotonically increasing 
concave function and maximized at /x™'^^. By sorting {Pi^gi} in an increasing order according to the permutation 
(17), we remark that the RHS of (33) has at most M possible values in (18) and we choose the optimal jl* 
according to (19). 

Appendix VI 

Proof of Corollary 2 : a closed-form solution of M = 2 under partial CSIT 
Without loss of generaUty, we assume -Pi7g,i < P2lg,2- We recall that the possible values of the water level (18) 
for M = 2 are , „ „ 

„ _ 1 , p ^ „ _ „ma^ _ 1 + A7g,l + P2lg,2 

/-fl — -L + J^llg,! M2 — /-'' — ^ ■ 

First we consider the case fii < ii2- This inequality reduces to P2lg,2 > Pilg,\ + ^nd further yields 

^l7ff,l < Ml < < P2lg,2- 

Obviously we obtain pi = Pi no matter which of the two values is the optimum water level. It is not difficult to 
see that the optimal water level is given by ni by comparing the two objective values J{ili) and J{p2)- Hence the 
amplifier power of relay 2 is p2 = = ^-^-p-^- Next we consider the case //2 < A*i- This inequality is equivalent 
to ^^275,2 < Pilg,i + 1' and yields P2lg,2 < M2- In tms case, the water level (either /xi or ^2) is larger than 

''Some of the M values might be unfeasible if they are not in the domain jl e [/imin, Amax]. 
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Pi7g,i P27g,2- Hence, the algorithm lets both relays transmit at the maximum power, pi = Pi and p2 = P2- 
In summary, we have the following power allocation possibilities: 

. pi = Pi and p2 = < P2 if P2792 > Pi7g,i + 1 

• Pl= Pi and P2 = P2 if P27g2 < Pllg,l + 1 

By symmetry, when Pijgi > P2lg2 we obtain 

. Pl = Pi and P2 = P2 if P2lg2 < Pllg,l < P2lg2 + 1 

• Pi = < Pi and p2 = P2 if Pi7ffi > P2lg2 + 1 

These conditions and the corresponding power allocation are summarized in (20) and depicted in Fig. 4. Since 
the parameter Pi-fgi does not provide an easy interpretation (note that Pj is a function of h), we express the power 
allocation region in terms of 7^, h explicitly. The subregion where only relay 1, 2 is allocated its maximum power 
is given respectively by 

. . Prlg2{\hl\^ + ft) ^ Prlgl{\h2? + i) jVp 

Ps\hl\'^ +Prlgl+ Nq Ps' Ps\h2\'^ +Prlg2+ Nq Pg 

These conditions yield the power allocation region in terms of h in Fig. 6. 
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Fig. 1. A wireless relay network 
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Fig. 2. Convergence of on-off algorithm for different M 
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Fig. 3. Proposed waterfilling solution with M = ?> 
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Fig. 4. Power allocation region as a function of Pi'ygi 
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Fig. 5. Two-relay ON/OFF region under perfect CSIT 
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Fig. 6. Two-relay power region under partial CSIT 
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